An Empirical Analysis of the Open Source Development Process Based on Mining of Source Code Repositories
نویسندگان
چکیده
This paper presents an empirical analysis of the Open Source development process from the point of view of the involvement of the developers in the production process. The study focuses on how developers contribute to projects in terms of involvement, size and kind of their contribution. Data have been collected from 53 Open Source projects and target application domains include different areas: web and application servers, databases, operating systems, and window managers. Collected data include the number of developers, patterns of code modifications, and evolution over the time of size and complexity. The results of this study show evidence that there are recurrent patterns in Open Source software development and these patterns are common to all the projects considered even if there are no superimposed processes for development, application domains are different, and there are contributions from people spread across the world.
منابع مشابه
Modeling Modern Code Review Practices in Open Source Software Development Organizations
Many Open Source Software (OSS) communities has adopted peer code review as an effective quality improvement practice. The informal, regular, and tool-based code review process has been called ’modern code review’. There has not been much research on the modern code review process. This dissertation aims to model the this code review process adopted by the software organizations through 1)under...
متن کاملSoftware Repositories: A Source for Traceability Links
This paper analyzes six open source projects in order to assess software repositories, such as those managed by Subversion, as a source for uncovering/discovering traceability links between different types of software artifacts. Our finding suggests that software repositories store a variety of artifacts that are central to open source development and use. Furthermore, a heuristic-based approac...
متن کاملData Mining for Software Process Discovery in Open Source Software Development Communities
Software process discovery has historically been an intensive task, either done through exhaustive empirical studies or in an automated fashion using techniques such as logging and analysis of command shell operations. While empirical studies have been fruitful, data collection has proven to be tedious and time consuming. Existing automated approaches have expedited collection of fine-grained d...
متن کاملMining Software Repositories to Guide Software Development
Software repositories (such as source control repositories) contain a wealth of valuable information regarding the evolutionary history of a software project. In this research we recover such historical data and present several techniques and approaches to guide managers and developers working on large software systems. We validate our work empirically using data based on over 60 years of devel...
متن کاملMining the Categorized Software Repositories to Improve the Analysis of Security Vulnerabilities
Security has become the Achilles’ heel of most modern software systems. Techniques ranging from the manual inspection to automated static and dynamic analyses are commonly employed to identify security vulnerabilities prior to the release of the software. However, these techniques are time consuming and cannot keep up with the complexity of ever-growing software repositories (e.g., Google Play ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- International Journal of Software Engineering and Knowledge Engineering
دوره 17 شماره
صفحات -
تاریخ انتشار 2007